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CROSS REFERENCE TO RELATED APPLICATIONS 

This application is related to and incorporates by reference herein in its entirety 
the commonly owned and concurrently filed U.S. Patent Application Attorney Docket 
Number M-9477 US, entitled "PORTABLE BROWSER DEVICE WITH ADAPTIVE 
PERSONALIZATION CAPABILITY" by Claude-Nicolas Fiechter, Amir Ben-Efraim, 
Tea HeaNahm, and David Hudson filed on November 21, 2000 (hereinafter "the M-9477 
application"). 

This application is also related to and incorporates by reference herein in its 
entirety the copending, commonly owned U.S. Patent Application Serial No. 09/415,295, 
entitled "Portable Browser Device With Voice Recognition And Feedback Capability," 
filed October 8, 1999 (hereafter "the '295 application"). 



CROSS REFERENCE TO ATTACHED APPENDIX 

Appendix A contains the following files in one CD-ROM (of which two identical 
copies are attached hereto), and is a part of the present disclosure and is incorporated by 
reference herein in its entirety: 

Volume in drive E is 00121 1_1324 
Volume Serial Number is 1A53-A390 

Directory of E:\ 

12/11/00 01:25p <DIR> 
12/11/00 01:25p <DIR> 
12/11/00 01:25p <DIR> WML 
3 File(s) 0 bytes 
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Directory of E:\WML 
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10 



12/11/00 01:25p 
12/11/00 01:25p 
12/11/00 01:25p 
05/18/00 03:05a 
05/19/00 05:27a 
05/18/00 03:05a 
12/11/00 01:25p 
12/11/00 01:25p 
8 File(s) 



<DIR> 
<DIR> 

<DIR> BIN 

875 HELLO.WML 
1,226 HIBYE.WML 
1,986 LOGO. WML 

<DIR> PLAYLIST 

<DIR> VOS 
4,087 bytes 



Directory of E:\WML\BIN 



12/11/00 01:25p 
12/11/00 01:25p 
08/07/00 02:01p 
08/07/00 12:32p 
4File(s) 



<DIR> 
<DIR> 

1,387 PREPPLAY.PL 
239 README.TXT 
1,626 bytes 



Directory of E:\WML\PLAYLIST 



12/11/00 01:25p 
12/11/00 01:25p 
12/08/00 06:41p 
05/31/00 ll:01p 
12/08/00 06:43p 
05/18/00 05:48a 
12/11/00 01:25p 
12/08/00 06:45p 
12/08/00 06:46p 
12/08/00 06:48p 
06/01/00 11:31a 



<DIR> 
<DIR> 

1,738 BUSINESS. WML 
3,611 EMAIL.WML 
1,719 ENTERT-l-WML 
558 FINANCE.WML 
<DIR> IMAGES 

1,587 MARKET. WML 
2,603 NEWS.WML 
1,640 SPORTS.WML 
1,193 STOCKS.WML 
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05/19/00 U:33a 854 TABLE. WML 

05/18/00 05:48a 558 WEATHER. WML 

09/08/00 02:46p 2,652 WELCOME. WML 

14File(s) 18,713 bytes 

5 

Directory of E:\WML\PLAYLIST\IMAGES 





12/11/00 01:25p 


<DIR> 




12/11/00 01:25p 


<DIR> 


10 


2 File(s) 


0 bytes 




Directory of E:\WML\VOS 




12/11/00 01:25p 


<DrR> 


15 


12/11/00 01:25p 


<DIR> 




08/03/00 03:29p 


89 COMPWAP.BAT 




08/03/00 03:25p 


179 MASTER. VS 




12/08/00 06:39p 


395 README.TXT 




08/03/00 03:38p 


136 RUNWAP.BAT 


20 


08/04/00 07:14p 


691 WAPTEST.VS 




7 File(s) 


1,490 bytes 



Total Files Listed: 

38 File(s) 25,916 bytes 
25 0 bytes free 

The files of Appendix A form soiurce code of computer programs and related data 
of an illustrative embodiment of the present invention. The files welcome.wml, 
news.wml, business.wml, email. wml, entertain. wml, finance.wml, market.wml, 
30 sports.wml, stocks. wml, table.wml, weather. wml, hello. wml, hiBye.wml and logo. wml 
provide WML implementation of visual navigation software in a phone to identify audio 
content for playing. The remaining files contain source code for use in a Personal 
Computer (such as a PC available from Dell Corporation), running the Microsoft NT 
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Operating System. File prepPlayer.pl is for accessing the database to determine the latest 
content for a particular subcategory; installed in a Personal Computer which needs to be 
able to execute Perl CGIs. Specifically, prepPlayer.pl is a PERL script implementation of 
card generation software for use in the Personal Computer to provide telephone number 
5 and dialing instructions to cell phone in response to identification of selected audio 
content by the cell phone. 

Files master. vs, waptest.vs, comp.wap.bat, runwaptest.bat and readme.txt form an 
audio server in a Personal Computer that responds to an incoming voice call from a 
phone. This software reads the content of a file which currently sits on the WAP web 

10 server machine. The just-described file has in it the filename (or names) of the contents 
to be played. This software should be installed on the WAP voice server, and started 
with the runwaptest.bat batch file. This software can be compiled with a VOS compiler 
available from Parity Software of 3 Harbor Drive, Sausalito, California. 

Microsoft SQL server may be used to generate a database for use with the above- 

1 5 described software in the Personal Computer. 

A portion of the disclosure of this patent document contains material which is 
subject to copyright protection. The copyright owner has no objection to the facsimile 
reproduction by anyone of the patent document or the patent disclosure, as it appears in 
the U.S. Patent and Trademark Office patent files or records, but otherwise reserves all 

20 copyright rights whatsoever. 

BACKGROUND 

Certain cellular phones (also called "wireless phones" and "cell phones") have 
the capability to place a phone call and also provide a wireless link to the Internet for the 

25 download of data that is visually displayed to the user. The data is presented on a display 
(such as a liquid crystal display (LCD) or a plasma display) by micro browser software 
that is prograirmied into a memory of the cell phone and executed by a processor (such as 
a digital signal processor) also included in the phone. The data may be displayed in a 
hierarchical manner, wherein a home page contains a number of categories for selection, 

30 and a selected category in turn may contain a mmiber of categories, and so on. At some 
point in the hierarchy, a category contains a number of items of data (also called 
"content"). The data may also be provided in a list, based on use of a search term to find 
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items in the list. Therefore, a user can use a cellular phone to find a telephone number of 
a restaurant, and the user can then place a voice call to speak with an employee of the 
restaurant, in the normal maimer of using a telephone. 

Wireless apphcation protocol (WAP) is a specification (see 
5 http://www.wapforum.org/ what/technical.htm) that enables cell phones to access data 
fi"om the Internet. Such a cell phone interprets files that are written in a tag-based 
language called Wireless Markup Language (WML), which is a DTD of XML 
(extendable Markup Language). Telephony operations in such a device can be 
controlled through a standard called Wireless Telephony Application (WTA). In the 
10 following example, a WML file when executed by a cell phone causes the cell phone to 
dial the phone nvunber 555-1212 for Directory Assistance in response to user selection of 
the link: 

<wml> 

15 <caid> 

<a href="wtai://wp/mc;5551212"> Directory Assistance</a> 

</wml> 

20 For other such mechanisms, see the specification "WAP-170-WTAI" version 07-Jul- 

2000, entitled "Wireless Application Protocol Wireless Telephony Application Interface 
Specification" available fi-om www.wapforum.org. 

Certain cell phones, such as Nokia 7110 do not conform to the WTAI standard. 
However, such phones, may have a different interface. For example, Nokia 7110 allows 

25 selection of a "use number" fimction (fi-om various options) when a number on a page is 
displayed, and when selected automatically disconnects the user firom the Internet and 
sets the phone up for use (by operating a green telephone button to make the call). Other 
Nokia phones appear to allow pressing a "dial" phone button to cause the voice call to be 
initiated using the displayed phone number. 

30 Instead of cell phones, other devices such as personal computers running MS 

Windows NT, and PDAs running Palm OS™ (e.g. Pahn HIc, Hie, EIx, V and Vx 
platforms) may interpret WML pages, by use of WAP simulators, such as WAPman 
browser available fi-om htt p://www.edgematrix.com , WinWAP 3.0 available firom 
http://www.slobtrot.com/eng /index.httnl, and Mobone WAP browser available firom 

35 http://www.mobone.com/wapbrowser. 
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According to EdgeMatrix, the WAPman browser can also be used with fixed-Une 
videophones and in-car Internet devices. Another such device is Qualcomm's pdQ which 
puts an actual version of 3 Corn's Palm PDA interface on the cell phone's screen. See also 
the iPaq Blackberry Handheld available from Compaq Corporation. 
5 A message posted on August 8, 2000 at the Nokia WAP Discussion Board 

suggests that a cell phone can dial a phone number of a voice mail system (an interactive 
voice response system) based on instructions in a WML page, and the message suggests 
the need to dial an additional nimiber after dialing the phone number. Specifically, the 
message asks if there is a way to pass a number (such as 1, 2, or 3) along with a telephone 

10 number. The message suggests the example of dialing 43232135 (telephone) and then 
key in the number "2." Moreover, instead of WML pages, other cell phones may 
interpret Handheld Device Markup Language (HDML) pages that are also available 
through the Internet. User interaction with such cell phones is illustrated at, for example, 
http://mobile.vahoo.com/wireless/tour?.pv=vp&.pg==3&.ph=tp. 

15 To the Applicant' s knowledge, cell phones do not have the ability to execute 

RealPlayer software (available from www.real.com) that is commonly used in personal 
computers. Specifically, a computer programmed with the RealPlayer software can 
playback audio (or video) that is stored locally in the computer, as well as playback a clip 
that is being played in real-time over the Internet (also called "streamed content") and 

20 that is being buffered locally. The RealPlayer provides access to continuous real-time 
sfreaming media from a variety of radio stations throughout the world (usually a 
combination of live and pre-recorded programs). The user can select a station with the 
Radio Tuner feature in the Radio menu. Opening the Radio Tuner takes the user to a site 
at Real.com. By following the links, a user can select and play radio broadcasts. The user 

25 can also search for stations by their name/call letters, or browse through available stations 
sorted by category. 

The RealPlayer also provides access to "channels" that provide one-click access 
to content that is updated on a frequent basis. Specifically, a service called "My 
Channels" provides the user with up-to-date headlines from all of the user's selected 
30 Channels when connected to the Internet. Headlines for each Channel appear to the right 
of the Channel's icon as the user moves the cursor over the Channels. The user can also 
set the Headlines to display and scroll automatically. Moreover, headlines update 
automatically while the user is connected to the Internet. 

-6- 

686944 v2 



EXPRESS MAIL LABEL NO. 

EL 710208052 US 
M-9389 US 

The RealPlayer includes a "Search" feature that offers the user the opportunity to 
type in words or phrases of interest and looks for streaming media related to those 
subjects. Search returns a results page similar to those returned when the user searches 
for Web pages. The RealPlayer also includes a "Guide" feature that takes the user to a 
5 media hub for free hitemet audio and video software and programming. At this website, 
the user can find programming featured on the hitemet and play it with one click. The 
user can choose from more than 2,500 radio and television stations, 8,000 Web sites and 
500 daily live events. To the Applicant's knowledge, the RealPlayer is limited to 
computers, and is not available for cell phones. 

10 

SUMMARY 

A system and method in accordance with this invention visually display on a 
screen (also called "monitor") of a telephone handset (such as a cellular phone supporting 
Internet access), descriptions of audio contents that are stored in a computer readable 

1 5 storeage mediima and that are available for selection by the user. On selection of an 
audio content description, the handset places a voice call to a computer that plays the 
audio content to the user during the voice call. The telephone handset uses a data 
connection to retrieve the description(s) for visual display, but this data connection is not 
used for retrieval of a file containing the audio content. Instead, a voice call is placed in 

20 the normal telephony manner, and the audio content is played back to the user by the 
computer that receives the voice call. The system and method are implemented by a 
single business that (1) makes accessible to telephone handsets the descriptions of audio 
contents and also (2) plays the selected audio content over the voice call. 

In one implementation, the telephone handset conforms to the WAP protocol, and 

25 the audio content descriptions are provided as WML cards. Such WML cards may 

identify a number of audio contents by description (either in text or graphics), and also by 
one or more corresponding telephone numbers that may be used to place a phone call to 
play the respective audio contents. Therefore, at the time of visual display of the audio 
content descriptions, the telephone handset already contains at least one phone number 

30 that is to be dialed. A user's selection of an audio content causes the handset to dial such 
a phone number (either automatically or after the user approves such dialing by pressing 
a button on the handset), thereby to set up a voice call for playback of the audio content. 
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In one embodiment, the computer assigns, ahead of time, a unique phone nimiber 
to each audio content that is available for playback. Assignment of imique phone 
numbers is done independent of the users and remains static for a long period of time, 
e.g. a few days, hi an alternative embodiment, all of the available audio contents are not 
5 identified by unique phone numbers ahead of time and instead one or more phone 
numbers are shared across multiple users, hi this alternative embodiment, the method 
and system keep track of which descriptions are being provided to which users (that are 
currently navigating the available audio contents) at any given time. This procedure is 
hereinafter called "maintaining state." In one implementation, the same phone number is 

10 dialed by all users. When a phone call comes in, for playback of an audio content, the 
computer identifies the user fi-om information provided by the handset (e.g. via subscriber 
identifier, such as the MSN and/or ESN that are programmed into each cell phone, or via 
caller id), and the appropriate audio content is identified based on the user's state that is 
being maintained. In another implementation, one or more phone mmibers are assigned 

15 dynamically (e.g. several times a day or on demand) to audio content descriptions that are 
to be displayed to users. 

Depending on the embodiment, a computer process (also called "audio playback 
process") that plays an audio content in response to a voice call initiated from the handset 
may be integrated with or separate fi"om another computer process (also called "audio 

20 description process") that initially provides the audio content descriptions. When 
different, the two processes are coupled to one another to exchange information 
therebetween. The two processes may be executed in the same computer or in different 
computers that are linked to one another. 

25 BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 illustrates, in a flow chart, a method of playing audio content via voice 
calls initiated fi-om visual navigation in one embodiment of the invention. 

FIG. 2 illustrates, in a block diagram of a cellular phone, a Hst of descriptions of 
audio content available for selection by the user, when performing the method of FIG. 1. 
30 FIGs. 3A and 3B illustrate, in block diagrams of the cellular phone, visual 

navigation by browsing to obtain the Hst illustrated in FIG. 2. 

FIG. 4 illustrates, in a block diagram of the cellular phone, a form for entry of a 
search term to be used to obtain the list illustrated in FIG. 2. 
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FIG. 5 illustrates, in a flow chart, operations performed by the cellular phone of 
FIG. 2 in one embodiment. 

FIG. 6A illustrates, in a system level block diagram, interconnections between the 
cellular phone of FIG. 2, and one or more servers that provide text content for visual 
5 navigation, and audio content during voice calls in response to the operations illustrated 
in FIG. 5. 

FIG. 6B illustrates, in a device level block diagram, various components in the 
cellular phone of FIG. 2, and their use in forming the two kinds of connections. 

FIGs. 7 and 8 illustrate, in flow charts, two alternative implementations of the 
1 0 method illustrated in FIG. 5. 

FIG. 9 illustrates, in a flow chart, acts performed during the operation of placing 
voice calls illustrated in FIG. 5. 

FIG. lOA illustrates, in a flow chart, operations performed by a server to provide 
descriptions for use during visual navigation as described in reference to FIG. 8. 
1 5 FIGs. 1 OB-lOE illustrate tables in a database for use with the method described in 

FIG. lOA. 

FIG. 1 1 illustrates, in a flow chart, supply of audio content in response to 
placement of a voice call as illustrated in FIG. 8. 

20 DETAILED DESCRIPTION 

In accordance with the invention, a system and method make available to the 
users of cell phones, a number of descriptions of audio content and on selection of one of 
the audio contents, supply the selected audio content through a voice call. Therefore, in 
accordance with the invention, the user uses (as illustrated by operation 101 in FIG. 1) a 

25 cellular phone 110 (FIG. 2) to visually navigate the descriptions in list 111 that describe 
audio contents available for playback. 

In one embodiment, a business supplies to cell phones a list of categories (e.g. as 
illustrated in FIG. 3B) and imder each category a list 111 of descriptions of audio 
contents all through a data connection, and a user visually navigates through such 

30 descriptions to make a selection. The user may select (as illustrated by operation 102) an 
audio content described by a headline 112 (see FIG. 2) for playback. Next, the same 
business that supplied the descriptions also plays the selected audio content on the user's 
phone 1 10, in response to a voice call placed by the user's cell phone 1 1 0. The user 
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listens (as illustrated by act 103 in FIG. 1) to the playback of the selected audio content 
through speaker 1 13 of the cellular phone 110 (FIG. 2). The combination of a cell 
phone's visual interface for navigation, and the cell phone's audio interface for playing of 
audio content provides the benefits of both: the ease of navigation provided by web 
5 pages and the quality of audio playback provided by a voice call of the type normally 
placed by a telephone handset. 

In one implementation, such a business generates descriptions in list 111, and 
either generates the audio contents or licenses the audio contents for playback to the cell 
phone users. For example, the audio contents may be Hcensed from an established news 

1 0 service, such as BBC. Therefore, depending on the implementation, the business may 
create, modify, and/or combine or split up audio contents (e.g. may break up the hour 
long news report "BBC World Service" into individual news stories, and may make up a 
headline for each news story). 

Depending on the embodiment, the audio signal may be either modulated directly 

15 on to a carrier or converted to digitized samples (e.g. at about 8000 samples per second) 
and transmitted as 1 s and Os. If a wireless link is used, the same link may be shared by 
the data coimection and by the voice call, with data transfer suspended during the voice 
call and resumed on completion of the voice call (when the computer finishes playing the 
audio content). Therefore, such use of the same link, for the voice call and for the data 

20 connection, is mutually exclusive. Alternatively, depending on the implementation, a 
wireless link may be set up and torn down for each of: voice call and data connection, 
and in such an implementation the user may be billed at different rates for the voice call 
and for the data connection. 

Note that the voice call of one embodiment is placed in the normal telephony 

25 manner, and is therefore different from a "Voice Over IP" call (wherein IP is an 

abbreviation for Internet Protocol). This is because a data coimection is not used (in this 
specific embodiment) for transfer of the audio contents that are played by the user's 
phone 110, unlike the Real Player software that uses the TCP/IP coimection to transfer 
audio files. 

30 Although cellular phone 110 is illustrated m FIG. 2, any other handheld device 

having a monitor for visually displaying a list of audio content descriptions, a speaker for 
playing of selected audio content, telephone circuitry for placing a voice call, and a 
modem for forming a data connection to obtain the descriptions can be used in 
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accordance with the invention. Depending on the embodiment, the modem may form the 
data connection over the voice call, or form the data connection independent of the voice 
call (e.g. over a Bluetooth link as in Ericsson R320 cell phone). Examples of other 
handsets include, for example, a personal digital assistant (PDA) or a handheld personal 
5 computer. 

Depending on the type of cellular phone or other telephone handset, user approval 
may be required for placement of a voice call (as illustrated by operation 104 in FIG. 1) 
between the acts of selecting the audio content and listening to the selected audio content. 
The approval in operation 104, and the selection of audio content in operation 102 may 

10 be performed in the normal manner for example by touching an "OK" button 115 (FIG. 
2), or even via voice commands from the user if the system or if cell phone 110 includes 
voice recognition fimctionahty as described in, for example, the '295 application 
incorporated by reference above. 

Also depending on the embodiment, a handset of the type described herein may 

15 make the voice call either over a land line or over a wireless link. One example of a 
handset that uses a land line for telephony and another for Internet access is iPhone 
(available from www.bigplanet.com), which is described as an integrated telephone and 
Internet access device, with a built-in touch screen, keyboard, modem and software. In 
another example, a handset that uses one wireless link to handle both a voice call and a 

20 data connection is the pdQ available from www.kyocera-wireless.com . 

List 111 (FIG. 2) displayed on monitor 1 14 can be obtained by a user in the 
normal manner, for example by browsing (see FIGs. 3A and 3B), or by searching (see 
FIG. 4). Specifically, in FIG. 3 A, a server that provides the audio contents has a website 
that lists a niimber of categories, such as, news, entertainment, reference and voice mail. 

25 On selection of one of these categories, e.g. the news category, a number of subcategories 
are displayed on monitor 114 (FIG. 3B). The user can select one of the subcategories, 
e.g. CNN Headline News, to obtain list 111 illusfrated in FIG. 2. Alternatively, as 
illustrated in FIG. 4, the user may search for items of interest by entering a search term, 
thereby to obtain list 111 of FIG. 2. Therefore, the user obtains list 111 that describes a 

30 nimiber of audio contents available for playing through speaker 1 13 by visual navigation 
of various choices displayed on monitor 114. 

Depending on the embodiment, the visual navigation by the user may be assisted 
by personalization software that may, for example, display lists of categories, 
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subcategories or descriptions of audio content in a prioritized manner, based on the user's 
past beliavior and/or the user's expressed preferences, as described in, for example, the 
M-9477 application incorporated by reference above. Also depending on the 
embodiment, the audio contents may be generic in the sense that any user can access the 
5 same audio contents, or the audio contents may be specific to the user in the sense that 
only a particular user can access the audio contents. Examples of generic audio contents 
include news and weather reports. Examples of the specific audio contents include voice 
mail messages that are accessed via visual navigation as described herein. 

In one embodiment, a microprocessor 139 (also called "WAP microprocessor") in 

10 cell phone 110 (see FIG. 6B) uses a data connection (see operation 121 in FIG. 5) to 

receive list 11 1 of the descriptions of audio content that are available for playing through 
speaker 113. List 111 may be received as a portion of one or more WML card(s) that are 
stored in a memory 137 of cell phone 110 and that are interpreted (i.e. executed) by WAP 
microprocessor 139. Therefore, in act 122, microprocessor 139 visually displays list 111 

15 on monitor 114. Thereafter, cell phone 110 waits (see operation 123 in FIG. 5) to receive 
from the user a selection of one of the displayed descriptions. 

In response to user selection, cell phone 110 does not use the data connection 
described above in reference to operation 121 to retrieve the selected audio content. 
Instead, microprocessor 139 (under control of instructions in memory 137) causes a 

20 transceiver 138 (also called "voice call transceiver") in cell phone 1 10 to place a voice 
call (see operation 124 in FIG. 5), for playing the selected audio content through speaker 
113 (FIG. 2). Transceiver 138 includes circuitry that is normally used to place a voice 
call, e.g. in case of digital wireless, an analog-to-digital converter to convert a sound 
signal from a microphone in to digital form for transmission by antenna 119, and 

25 similarly audio in digital form received from antenna 1 19 is converted into analog form 
by a digital-to-analog converter for supply to speaker 113. Transceiver 138 also includes, 
in case of a cell phone, the wireless circuitry normally used to set up and tear down voice 
calls over a cellular telephony network. 

Thereafter, microprocessor 139 in cell phone 110 waits (see operation 125 in FIG. 

30 5) for the voice call to be completed, and on completion of the voice call returns to act 
121 (described above). Note that during the voice call, the WAP microprocessor 139 is 
not used at all. Instead, only the voice call transceiver 138 is used, and performs the task 
of receiving the audio and playing the audio through speaker 1 13, in the normal manner 
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of processing a voice call. Specifically, the audio signal received from antenna 119 is 
passed by the voice call transceiver 138 directly to speaker 113. In contrast, when using 
the data connection to download descriptions 111 for visual display, WML 
microprocessor 139 is used to execute instructions in the WML card(s) that are 
5 temporarily held in memory 137. 

As the audio played on speaker 113 is obtained through the normal telephony 
voice call in this embodiment, the quahty is significantly better than the quality of a voice 
over IP telephone call. For this reason, in this embodiment the audio is played on 
speaker 113 as soon as call setup is completed (e.g. instantaneously), and is played 

10 continuously until the end without any interruptions. There is no "streaming" or 
"buffering" of the type normally performed by the Real Player (see www.real.com). 
Therefore, there is no initial delay caused by Internet in the audio playback of this 
embodiment. Also in this embodiment, there is no delay during the call, e.g. due to 
"congestion" on the Internet. The above-described visual navigation of choices 

1 5 eliminates the prior art need for a user to navigate through a set of voice prompts to 
identify an audio clip to be played, e.g. as required by a voice mail system. Instead of 
using voice recognition as described above, a different telephone handset may use a 
different input mechanism for the selection of audio content description during visual 
navigation, e.g. a touch screen. 

20 The above-described distinction between using a data connection to obtain the 

text content for visual navigation, and using the voice call to provide the audio content 
played on speaker 1 13 is a critical aspect of this embodiment. For this reason, FIG. 6A 
illustrates data coxmection 126 separate and distinct from voice call 127 although in this 
embodiment cell phone 110 uses the same wireless medium to transfer both contents. 

25 Note that in other embodiments, there may be no such distinction, e.g. a data connection 
may be used to obtain the text content and the same data connection may be used to carry 
a voice call, e.g. as in Voice Over IP. 

In one specific implementation, cellular phone 110 forms a data connection with 
computer 131 that provides the hst of descriptions from database 132. Computer 131 is 

30 equipped with one or more modem(s) (not labeled) or routers that are connected to the 
Internet, in the normal maimer. Computer 131 may include a database as described 
below. Cellular phone 1 10 places the voice call to another computer 135 that plays the 
audio contents from another database 136. Computer 136 is equipped with one or more 
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telephone call processing circuits (such as the D/41H board available from Dialogic 
Corporation of Parsippany, New Jersey) and driver software for use of such circuits. The 
call processing circuits (not labeled) of computer 136 are connected to the telephone 
network 128 in the normal manner. 
5 Note that in other implementations computers 131 and 1 35 may be replaced with 

a single appropriately programed computer or hard wired circuitry or some combination 
thereof that performs the functions of both computers 131 and 135. Moreover, such 
functions may be performed by two or more separate processes in the single computer, or 
all functions may be performed by a single integrated process. For this reason, the term 

10 "logic" is used to refer to a computer executing a group of software instructions or a 
portion of hard wired circuitry or some combination thereof that performs a specified 
function (such as one logic providing an identifier of an audio content to another logic). 
Regardless of how many logics are used to support method 129 of FIG. 5, all such logics 
are operated by the same business, so that they interoperate seamlessly with one another. 

15 Use of a cell phone 1 10 or other handheld device as described herein has the 

advantage of providing the most current content (e.g. news), as compared to, e.g. the Rio 
600 (available from S3.Lic. of Santa Clara, CA) that merely stores and plays back MP3 
files. Moreover, cell phone 110 or other handheld device uses a voice call directly, 
because the audio is played at the other end of the voice call by a computer, as compared 

20 to Rio 600 which uses the TCP/IP stack to download the MP3 files. In two different 
implementations of method 120 (FIG. 5), the audio contents are either assigned phone 
numbers ahead of time and such assignments are kept static, or alternatively the phone 
numbers are assigned about the time of selection of audio contents in a dynamic manner. 
In case of the static assignment of phone numbers, if two separate computers 131 and 135 

25 are used, the computer that makes the assignments (either of 131 and 135) provides a list 
of phone numbers and the corresponding audio contents to the other computer, e.g. 
through a database that may be commonly accessed by the two computers. In case of 
dynamic assignment, computer 131 that provides the audio content descriptions also 
provides the state of the user at the time of selection to computer 135 so that each 

30 incoming call is appropriately matched to the audio content selected by the user that 
initiated the incoming call. 

When a static assignment of telephone numbers is being used, cell phone 110 may 
implement method 140 (FIG. 7). Specifically, in act 141, cell phone 110 uses data 
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connection 126 to receive descriptions of audio content and also the corresponding 
telephone numbers that are to be dialed for playing of the audio content. Thereafter, in 
act 142, cell phone 1 10 displays the received descriptions to the user. Depending on the 
variant, cell phone 110 may or may not display the corresponding telephone numbers. 
5 Thereafter, in act 143, cell phone 1 10 receives from the user a selection of one of the 
descriptions when the user presses button 115 (see FIG. 2). Alternatively, a user may 
simply select the telephone number if such phone number is displayed in act 142 (FIG. 7) 
as described above. Next, in act 144, cell phone 110 places a voice call using the 
telephone number of the selected description, thereby to play the selected audio content. 

10 Next, cell phone 110 waits for the voice call to be completed in act 145, which may 
happen either on completion of playing of the audio content by computer 135 or in 
response to the user touching the "call" button 1 18 (to terminate the voice call). 

Method 140 (FIG. 7) can also be performed with the dynamic assignment of 
telephone numbers to descriptions in list 111, immediately prior to the transfer of these 

15 descriptions to cell phone 110. For example, if five descriptions of news clips are 
displayed in list 111, five imique telephone niraibers may be dynamically assigned to 
identify each of the five different news clips. In such a case, the state of each user is 
maintained, so that computer 135 is informed as to which particular phone number 
corresponds to which particular audio clip to be played for this particular user. 

20 Therefore, in such an example, other users can simultaneously receive the same five 

phone numbers even if the corresponding audio clip descriptions being presented to them 
are different (e.g. top five hit songs), because their states uniquely identify such 
descriptions to computer 135. Such dynamic assignment of telephone numbers to the 
descriptions that are to be displayed to each user has the advantages of reusing the 

25 telephone numbers across multiple users, and also using fewer telephone numbers than 
would be otherwise required if a imique phone number is to be assigned to each audio 
content. 

Regardless of whether the telephone number assignment is static or dynamic, the 
card received in method 140 by cell phone 110 has the following code in one example. 

30 
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<wml> 
<card> 

<a href="wtai://wp/mc;5551 101"> L ARAB LEADERS BLAST ISRAEL</a> 
<a href^"wtai;//wp/mc;555 1 102"> 2. KRUSK SALVAGE HAMPERED</a> 
5 <ahref="wtai://wp/mc;5551103">3. HILLARY WINS ENDORSEMENT</a> 

<a hre:f="wtai://wp/mc;555 1 104"> 4. GE TO BUY HONEYWELL</a> 
<a href="wtai://wp/mc;555 1 105"> 5. YANKEES WITHSTAND METS</a> 

</card> 
</wnil> 

10 

In one implementation, the above-listed code is received in a single message. In 
an alternative implementation, only the descriptions are received in a message (e.g. in one 
card) and only the phone numbers are received in another message (e.g. in another card), 
and the phone numbers are related to their respective descriptions by identifiers, e.g. the 

15 numbers 1-5 in the above-listed code may be used as such identifiers. In the alternative 
implementation, such an identifier is passed fi-om one card to another card, to identify the 
phone number to be dialed. 

When cell phone 110 executes the above WML code, the moment a user selects a 
description, the corresponding telephone number is automatically dialed. For example, if 

20 the user selects "HILLARY WINS ENDORSEMENT," immediately the number 

555-1 103 is dialed by the cell phone. Note that instead of WML, another language, such 
as "i-mode" compatible HTML as defined by DoCoMo of NTT, at for example 
www.nttdocomo.com can also be used in the manner described herein, with an 
appropriate instruction (also called "tag") to dial a specified phone number. 

25 In another embodiment, method 150 (FIG. 8) implements the method 120 

illustrated in FIG. 5 by not providing to cell phone 110 a list of telephone numbers that 
correspond to descriptions in list 111. Instead, as soon as the user selects one of the 
descriptions in list 111 cell phone 110 transmits the selection over the data connection to 
computer 131 that in turn responds by providing a telephone number to be dialed for 

30 playing of the selected audio content. Specifically, in act 151 (FIG. 8), cell phone 1 10 
receives only the descriptions of audio content through data connection 126, and does not 
receive any telephone numbers, but receives identifiers (such as the number "10000001" 
for the category CNN as illustrated in news.WML of Appendix A). Thereafter, in act 
152, cell phone 110 displays the received descriptions as list 1 1 1 on monitor 114 (FIG. 

35 2). Next, in act 153, cell phone 1 10 receives from the user, e.g. via operation of scroll 
button 1 16 to move cursor 112 over one of the five descriptions, and ok button 1 15, to 
indicate a selection of the highlighted description in list 111. Next, cell phone 110 
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transmits (in act 154 of FIG. 8) an identity of the selected description, e.g. transmits the 
number 1000001 over data connection 126 (FIG. 6A), e.g. by execution of the "DoPlay" 
card (see file "news. WML" in Appendix A): 

5 <card id="DoPlay"> 

<onevent type="onenterforward"> 
<gohre^"http://63.199.168.230/cgi-biB/prepPlayer.pl?subcatId=$(subCatId)&retF 
=news.wml&retC=News" /> 
</onevent> 
10 </card> 

In the above "href statement, the file "prepPlayer.pl" (see Appendix B) identifies 
a PERL script to be played by computer 131 and "subCatId" represents the value of the 
selection made by the user that is passed to the PERL script. In the above statement, the 

15 argument retF identifies the WML card "news.wml" to be loaded into cell phone 110 on 
completion of the audio playback and argument retC identifies an argument "News" that 
is to be passed to the WML card. Argiraients retF and retC are not critical to the 
implementation and may be skipped in other implementations. 

Next, in act 155, cell phone 110 receives over data connection 126 a telephone 

20 number that is to be dialed for playing of the audio content described in selection 1 12. In 
response, cell phone 1 10 places a voice call (see act 156 in FIG. 8) using the telephone 
number received over the data connection 126. The voice call is placed independent of 
the data connection as described above. Specifically, if the same wireless link is shared 
by the data connection and the voice call, the data connection is suspended for the 

25 duration of the voice call and is resumed on voice call completion. If different links are 
used, voice call and data coimection may be both active simultaneously. If the voice call 
is completed (see act 157) cell phone 110 returns to act 151, e.g. to allow the user to 
continue the visual navigation as described above. If the voice call is not completed, cell 
phone 110 goes to act 158 to wait for the voice call to be completed. 

30 Note that placement of voice calls in acts 124, 144 and 156 described above can 

be performed in one of two different ways, hi one implementation, the voice call is 
placed by automatically dropping the call currently used for data connection and dialing 
the telephone number (see act 161 in FIG. 9) without any user involvement. Therefore, 
in such an implementation, the user is not even aware that a voice call is being placed and 

35 instead simply hears the sound played through speaker 113 (FIG. 2). Alternatively, in 
another implementation, cell phone 110 displays on monitor 1 14 the telephone number 
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that is to be dialed for playing of the selected audio content. Thereafter, cell phone 110 
waits to receive the user's approval (see act 163 of FIG. 9). On receipt of the user 
approval, cell phone 110 automatically drops the data connection and dials the telephone 
number in act 164 (because data connection and voice call are mutually exclusive in this 
5 embodiment, although both may be simultaneously used in other embodiments). 

Therefore, in the alternative implementation, the user is made aware of the telephone 
number that is about to be dialed, and must approve the dialing of such telephone 
number. Note however that even in this alternative implementation, the user is not 
required to touch any of numeric keys 1 17A-1 17N (FIG. 2) to physically dial the 

10 telephone number. 

In one embodiment, a process in computer 131 performs method 170 (FIG. lOA). 
Specifically, in act 171, computer 131 waits for a request fi-om a cell phone, such as 
phone 110. Thereafter, in act 172, computer 131 checks to see if the request from cell 
phone 1 10 is asking for audio content. If not, computer 131 goes to act 173 and provides 

1 5 the descriptions of audio content or other such descriptions in the normal manner of a 
web server, such as the Apache server (or Microsoft's Internet Information Server) that 
serves web pages. The web pages (and the WML card(s) contained therein) being 
provided by the server may be accessed by the user either via browsing or via searching 
in the normal maimer. Thereafter, computer 131 returns to act 171 to wait for the next 

20 request. In act 172 if the request is for audio content, e.g. request identifies PERL script 
prepPlayer.pl as discussed above in reference to cell phone 1 10 in act 154 (described 
above in relation to FIG. 8), computer 131 goes to act 174 (FIG. lOA). 

In act 174, computer 131 saves the subscriber identifier of the user, and an 
identifier of the selected audio content, e.g. the nimiber 100001 . In the illustrative 

25 implementation ofprepPlayer.pl, computer 131 identifies the audio content file to be 

played to the user via an SQL query to database 132 (FIG. 6A) that contains a number of 
tables (FIGs. lOB-lOE), e.g. as illustrated in detail in Appendix D. In database 132, a 
field "body_file" in table "content" (FIG. lOB) contains the path name of the file 
containing the audio content to be played, e.g. if the "body type" field has the value "A" 

30 thereby to indicate that this file contains audio (another value "T" indicates that the file 
contains text). This table also contains a "description" field that identifies a description 
1 12 (FIG. 2) of the type described above. 
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Note that in this implementation, the content table (FIG. lOB) may contain a 
number of records that identify a corresponding nimiber of versions of contents that fit 
the same "description" (e.g. if there is an hourly news feed, there may be 24 such records 
for an entire day). Such multiple records are differentiated by the value in the field 
5 "crawl_id" in table "content" which is incremented when creating each record in the 
table. A corresponding "crawl_id" field in another table "subcategory" (FIG. IOC) 
identifies the largest value, and is used by the SQL query in prepPlayer.pl to identify the 
latest audio content. 

The subcategory table (FIG. IOC) also contains a "description" field that 

10 identifies the type of audio contents in a subcategory, e.g. audio contents that are 

"editorial" or "gossip" as in a newspaper, or more specifically as "CNN News." The 
subcategory table further contains a "prompt_file" field that identifies the path of a file of 
audio content that is to be played to provide the user with context (e.g. the audio contents 
may contain the spoken words "CNN News"). The subcategory table also contains a 

15 "category_id" field that identifies, in the hierarchy of menus, as to which category the 
subcategory described by this record belongs. For example, the subcategory of "CNN 
News" is identified as belonging to the "News" category by the value "1" that identifies a 
unique record in the "category" table (FIG. lOD). 

The category table (FIG. lOD) also contains a "description" field that identifies 

20 the type of subcategories, and also a "prompt file" field that identifies a file containing 
audio content to be played, again to provide context to the user. The user is identified by 
records in a user table (FIG. lOE) that contains a "subscriber_id" field that is used in this 
embodiment to uniquely identify the user based on their WAP session. Note that 
depending on the embodiment, other mechanisms may be used to identify a user. 

25 Returning to the method 170 (FIG. lOA), after performing the database query, 

computer 131 saves such information, for example in a file called "state file" that 
identifies the audio file to be played to each user by computer 135. Alternatively, such 
information may be saved in a record of a shared database that is accessible from both 
computers 131 and 135. For example, see the "open", "print" and "close" statements in 

30 prepPlayer.pl in Appendix B. Note that although in this example a fixed file name 

"Contentl.txt" was used, if multiple users are to be serviced, the number "1" in the file 
name is to be replaced by the user's subscriber ID or user's cell phone number. Note that 
computer 135 that receives this file uses the user's subscriber ID or user's cell phone 
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number from an incoming call to identify the relevant file to be used in playing the audio 
contents. 

Alternatively, if each user is assigned a different phone number for calling in by 
computer 13 1 , the phone number itself may be used in the file name; and computer 135 
5 identifies the file firom the phone number dialed by cell phone 110 (the dialed phone 
number is known by computer 135 because as noted below, computer 135 waits on one 
or more such telephone numbers at the corresponding ports of a call processing board). 
Note that instead of calling a local number, cell phone 110 may call a "1-888" or "1 -800" 
number that is translated in the normal manner into a corresponding local number that is 

10 being used by computer 135 to play audio. Therefore, in another embodiment, the phone 
number dialed by cell phone 110 is not the same as the phone number on which computer 
135 is waiting, but is translated therefi-om, e.g. when a call comes in firom cell phone 110. 

In act 175, computer 131 uses an available telephone number to create a file of 
instructions to be executed by cell phone 110 (e.g. generates a wml card as illustrated by 

15 the "print«END" statement and the rest of the text to the end of the PERL script 

prepPlayer.pl). Next, in act 176, computer 131 sends the created file to cell phone 110 
and returns to act 171 (which may happen implicitly on completion of the PERL script, 
for example). 

Computer 131 removes the telephone number fi-om, e.g. a list of available 
20 telephone nimibers. At a later time, e.g. after audio playback for this telephone number, 
computer 135 makes the telephone number available, e.g. adds the telephone number to 
the hst. Computer 135 may also make the telephone number available after a 
predetermined time, e.g. in case an incoming call for this telephone number never arrives. 
The example illustrated in prepPlayer.pl of Appendix B generates the following 
25 code in the card DoCall for execution by cell phone 110: 

<card id="DoCall" oiitimer="http://63 . 1 99. 168.230/wiid/$retumToFile\#$retumToCard''> 
<oneveiit type="onenterforward"> 
<go liref=" wtai://wp/mc; 1 65 0652643 1 "/> 
30 </onevent> 

<timer id="callWait" value-' !"/> 
</card> 

In the above code, the "onevent" tag indicates that the telephone number 
35 1650652643 1 is to be called when the event "onenterforward" occurs. So, as soon as ceil 
phone 110 receives this code, the received phone number is immediately dialed. On 
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completion of the phone call, the "callWait" timer causes cell phone 1 10 to wait for upto 
one-tenth second, depending on the embodiment. The amoimt of time depends on the 
implementation of cell phone 110, e.g. the timer may be started as soon as the card is 
loaded. On expiration of the timer, cell phone 110 goes to the URL provided by the 
5 "ontimer" variable, using as variables the values of retF and retC (described above). 

In the above-described embodiment, computer 135 performs a method 180 (FIG 
11) to provide audio content over a voice call 127 to cell phone 110 (FIG 6). 
Specifically, in act 181, computer 135 waits for a call. For example, computer 135 may 
wait on one or more telephone numbers at the corresponding ports of a call processing 

1 0 board that are to be used by cell phones 110. In the illustrative implementation shown in 
the attached Appendix C, see the statement "sc_wait(line)" in the file waptest.vs. 

Depending on the implementation, a number of cell phones 110 may use the same 
telephone number (e.g. if a himting group for multiple ports (to allow call connection to 
an unused port) has been established by arrangement with local telephone provider in the 

1 5 normal manner). Alternatively, computer 135 may wait for a call on any one of a number 
of telephone numbers that are reserved for each of or corresponding number of cell 
phones 110 that have requested audio content. 

Next, in act 182, as soon as voice call is received, computer 135 determines the 
subscriber ID of the cell phone 110 that initiated the telephone call. For example, 

20 computer 135 may look at the caller id information accompanying the voice call. 

Alternatively, computer 135 may be waiting on a phone number that was assigned to a 
user and that was identified, e.g. in a file name by computer 131 as described above in 
reference to act 175 of FIG. 10. In yet another embodiment, a database lookup is 
performed to match the incoming phone number against a record that contaisn a user's 

25 identity, e.g. if computer 131 stored such information in the database. 

Then, in act 183 (FIG. 1 1), computer 135 uses the state file or other information 
fi-om computer 131 (e.g. fi-om a database) to determine, based on the subscriber ID, the 
audio content that is to be played for the voice call received in act 182. In the illustrative 
implementation in the file waptest.vs, see the statement "fileHandle =...". Next, in act 

30 184, computer 135 starts a task that plays the audio content over the voice call 127 so that 
the audio content becomes available almost instantaneously via speaker 1 13 of cell phone 
110 (FIG 2). In the illustrative implementation in the file waptest.vs, see the statement 
"sc_play." While the task is running, computer 135 waits in act 185 for completion of 
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the task or alternatively for the user to hang up on the voice call 127. In the illustrative 
implementation in the file waptest.vs, see the "while" loop. 

In act 186, computer 135 checks to see if the task has completed. If so, computer 
135 goes to act 187 to hang up on (e.g. tear down) voice call 127 and thereafter returns to 
5 act 181 to wait for another voice call. If the audio task has not completed in act 186, 
computer 135 goes to act 188 to terminate the task and thereafter returns to act 181. 
Instead of spawning a new task as described above in reference to 184, computer 135 
may perform playing of the audio content in line in the current task, thereby to eliminate 
the overhead of spawning a new task. In such a case, computer 135 does not wait for 
10 completion of the audio playback and instead is responsive to the hang-up event of voice 
call 127 by cell phone 1 1 0. Such responsiveness can be implemented either by polling or 
by interrupt. 

The to-be-played audio contents may be retrieved by computer 135, either 
through the Internet in real time (e.g. before the descriptions are provided to any users) or 

1 5 may be cached ahead of time for supply to users on demand. The to-be-played audio 

contents may also be captured from a live broadcast by television or radio studio that may 
be accessed through the Internet or in other conventional manner. Also, descriptions of 
the audio contents may be prepared manually, or pre-existing descriptions may be 
downloaded from Internet at the same time and from the same location as the audio 

20 contents (e.g. see website www.on24.com and website www.real.com). 

Numerous modifications and adaptations of the embodiments described herein ill 
be apparent to the skilled artisan in view of the disclosure. Although in one embodiment, 
list 114 (FIG. 2) is limited to only audio content descriptions, in another embodiment 
such descriptions are interspersed among other descriptions, e.g. of text content that is 

25 available for visual display on selection. In such an embodiment, an icon (of a speaker) 
may be displayed to identify whether the content is text or audio or both, e.g. in the 
manner ofwww.cnn.com. Moreover, although certain software in the attached 
appendices is described for an illusfrative embodiment, other embodiments will be 
apparent to the skilled artisan in view of the disclosure. Numerous such modifications 

30 and adaptations of the embodiments, variants and implementations are encompassed by 
the attached claims. 
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